The residual-based predictiveness curve: A visual tool to assess the performance of prediction models.

نویسندگان

  • Giuseppe Casalicchio
  • Bernd Bischl
  • Anne-Laure Boulesteix
  • Matthias Schmid
چکیده

It is agreed among biostatisticians that prediction models for binary outcomes should satisfy two essential criteria: first, a prediction model should have a high discriminatory power, implying that it is able to clearly separate cases from controls. Second, the model should be well calibrated, meaning that the predicted risks should closely agree with the relative frequencies observed in the data. The focus of this work is on the predictiveness curve, which has been proposed by Huang et al. (Biometrics 63, 2007) as a graphical tool to assess the aforementioned criteria. By conducting a detailed analysis of its properties, we review the role of the predictiveness curve in the performance assessment of biomedical prediction models. In particular, we demonstrate that marker comparisons should not be based solely on the predictiveness curve, as it is not possible to consistently visualize the added predictive value of a new marker by comparing the predictiveness curves obtained from competing models. Based on our analysis, we propose the "residual-based predictiveness curve" (RBP curve), which addresses the aforementioned issue and which extends the original method to settings where the evaluation of a prediction model on independent test data is of particular interest. Similar to the predictiveness curve, the RBP curve reflects both the calibration and the discriminatory power of a prediction model. In addition, the curve can be conveniently used to conduct valid performance checks and marker comparisons.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Various Approaches in Prediction of Daily and Lactation Yields of Milk and Fat Using Statistical Models in Iranian Primiparous Holstein Dairy Cows

In this research, 272977 test day records collected from 659 herds during years 2001 to 2011 by the Iranian animal breeding center were used. In the first section the ability of different models to predict daily milk yield from alternative milk recording was tested. The result showed that a complex model including noon milking time plus the effect of lactation curve of Ali and Schaeffer functio...

متن کامل

Predictiveness curves in virtual screening

BACKGROUND In the present work, we aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology. The literature describes the use of predictiveness curves to evaluate the performances of biological markers to formulate diagnoses, prognoses and assess disease risks, assess the fit of risk models, and estimate the clinical u...

متن کامل

Semiparametric methods for evaluating the covariate-specific predictiveness of continuous markers in matched case-control studies.

To assess the value of a continuous marker in predicting the risk of a disease, a graphical tool called the predictiveness curve has been proposed. It characterizes the marker's predictiveness, or capacity to risk stratify the population by displaying the distribution of risk endowed by the marker. Methods for making inference about the curve and for comparing curves in a general population hav...

متن کامل

Comparison of Gestational Diabetes Prediction Between Logistic Regression, Discriminant Analysis, Decision Tree and Artificial Neural Network Models

Background and Objectives: Gestational Diabetes Mellitus (GDM) is the most common metabolic disorder in pregnancy. In case of early detection, some of its complications can be prevented. The aim of this study was to investigate early prediction of GDM by logistic regression (LR), discriminant analysis (DA), decision tree (DT) and perceptron artificial neural network (ANN) and to compare these m...

متن کامل

Pavement performance prediction model development for Tehran

Highways and in particular their pavements are the fundamental components of the road network. They require continuous maintenance since they deteriorate due to changing traffic and environmental conditions. Monitoring methods and efficient pavement management systems are needed for optimizing maintenance operations. Pavement performance prediction models are useful tools for determining the op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره 72 2  شماره 

صفحات  -

تاریخ انتشار 2016